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Accelerated  life  tests  that  maximise  Shannon  information 

By  Nick  Polson,  Nozer  Singpurwalla  and  Isabella  VerdinellP 

Abstract 

The  current  literature  on  accelerated  life  tests  emphasizes  issues  of  inference  and  extrapolation 
under  a  given  design,  rather  than  the  optimal  design  of  an  accelerated  test.  In  this  expository  paper, 
we  outline  a  framework  for  a  coherent  approach  for  the  conduct  of  accelerated  tests.  Our  approach 
is  based  on  the  Bayesian  paradigm  for  experimental  designs  under  linear  models  and  involves  the 
specification  of  a  utility  function.  For  the  latter  we  adopt  Shannon  Information  between  predictive 
densities.  The  optimal  design  is  then  selected  via  the  principle  of  maximum  expected  utility.  We 
illustrate  the  approach  with  some  special  cases. 
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1  Introduction 


Let  5i,  S2, . . be  a  collection  of  environmentaJ  conditions  (stresses  in  engineering,  doses  in  biome¬ 
try)  at  which  it  is  possible  to  conduct  life  tests,  and  suppose  that  52  -<  ■  ■  -  ,  where  "‘5,  -<  Sj” 
denotes  the  fact  that  Sj  is  judged  to  be  more  severe  than  5,.  Let  5^  ^  Si,  denote  an  environmentaJ 
condition  at  which  it  may  or  may  not  be  possible  to  conduct  a  life  test;  5^  will  be  referred  to  as  the 
use  conditions  environment.  Let  X,-  denote  the  life  length  of  an  item  under  Si,  and  suppose  that 
whenever  5,-  Sj,  X,-  >*'  Xj,  where  “X  >*'  F”  denotes  the  fact  that  P{X  >  x)  >  P{Y  >  x)  for 
all  I  >  0.  Suppose  X,-  is  assessed  to  have  distribution  i^,(-  |  0,),  where  0,  is  a  vector  of  unknown 
parameters.  It  is  typical  [Mann,  Schafer  and  SingpurwaUa  (1974)]  to  let  F,(-  ]  0,)  =  F{-  \  0,),  for 
all  i,  and  to  assume  that  f  is  a  commonly  used  distribution  function  in  life  testing,  such  as  an 
exponential,  a  gjunma,  a  lognormal  or  a  Weibull. 

The  term  “accelerated  test”  refers  to  the  fact  that  the  conditions  under  which  testing  is  done 
are  more  severe  than  those  under  which  the  item  is  intended  to  be  used,  so  that  the  failure  time 
of  an  item  is  shortened.  The  dual  scenario,  namely  testing  under  conditions  less  severe  than  use 
conditions,  has  received  less  attention  because  the  two  are  conceptually  equivalent.  The  aim  of 
accelerated  testing  is  to  facilitate  an  ecirly  inference  about  Xu  bearing  in  mind  the  requirement 
that  Xu  Xi.  To  accomplish  the  above,  one  embarks  on  an  experiment,  called  an  accelerated 
life  test,  and  makes  some  assumptions  which  facilitate  an  analysis  of  the  data  from  the  experiment. 
Of  the  assumptions  that  are  made,  a  key  one  is  the  time  transformation  function,  whose  aim 
is  to  relate  0,-  and  5,-,  i  =  1,2,...,.  The  design  of  an  accelerated  life  test  calls  for  a  specification  of 
several  quantities,  namely  the  k  levels  of  testing,  say  Si,<  S2  <  ■  •  •  <  Sk,  k  >  I,  Ui,  the  number  of 
items  to  be  tested  under  5,,  r,,  the  number  of  failures  to  be  observed  under  5,-,  and/or  p,  the  time 
of  termination  of  the  life  test  under  5,-.  Clearly,  if  X,j  denotes  the  time  to  failure  of  the  j-th.  item 
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under  5,,  then  <  T,,  /  =  1 . A:,  and  ;  =  1, . . . ,  r,-  <  n,-.  Other  protocols  for  the  design  would 

involve  a  combination  of  the  above,  such  testing  until  inin(r,,  .Y,r,),  or  testing  under  Si  for  a 
duration  T,,  and  then  continuing  to  test  the  unfailed  items  under  5,+i  for  a  duration  T.+i,  and  so 
on.  The  latter  tests  are  called  step-stress  tests  [Shaked  and  SingpurwaUa  (1983),  DeGroot  and 
Goel  (1979)],  and  will  not  be  the  topic  of  discussion  here. 

To  date,  much  of  the  statistical  literature  on  accelerated  life  testing  has  focussed  on  issues  of 
inference  about  given  an  accelerated  life  testing  protocol,  rather  than  the  more  encompcissing 
problem  of  designing  an  optimal  protocol;  see,  for  example,  Meinhold  and  SingpurwaUa  (1987), 
BlackweU  and  SingpurwaUa  (1988)  for  recent  bibUographic  citations.  The  few  pubUshed  papers 
which  address  the  question  of  design  appear  to  lack  a  sound  foundation  and  consequently  fail  to 
recognize  the  important  role  played  by  the  time  transformation  function  in  developing  an  optimal 
design.  The  aim  of  this  paper  is  to  prescribe  a  framework  for  the  coherent  conduct  of  an  accelerated 
Ufe  test.  SpecificaUy,  what  is  needed  is  a  paradigm  for  specifying  a  choice  of  k  and  the  associated 
stress  levels  5i,...,5fc,  the  number  of  items  to  be  tested  ni,...,ni,  the  test  truncation  times 
Ti,  r2, . . . ,  Tfc,  and  the  censoring  values  Subsequent  work  along  these  Unes  would  also 

involve  a  design  of  the  “testing  pattern”  for  step-stress  tests. 

Note  that  testing  under  a  very  severe  environmental  condition,  say  Sk+i,  where  Sk  -<  -Sit-i-i, 
could  make  the  reaUzations  of  X^+i*  degenerate  at  0.  That  is,  what  would  be  observed  under 
5fc+i  are  instantajieous  faUures.  The  recording  of  instantaneous  failures  occurs  because  of  a  finite 
resolution  of  the  recording  device,  and  should  not  be  interpreted  as  Xjt+i.  s  0.  Thus,  in  what 
foUows,  Sk  is  chosen  to  be  the  most  severe  environmental  condition  for  which  the  realizations  of 
Xk»  are  not  identicaUy  zero. 
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2  OVERVIEW 


The  statistical  model,  which  includes  a  choice  of  the  parametric  family  of  distributions  for  F  and 
also  a  choice  of  the  time  transformation  function  is  described  in  Section  3.  A  discussion  of  some 
consequences  of  the  above  choices  is  also  included  in  Section  3.  Section  4  describes  the  main 
contribution  of  the  paper,  namely  the  development  of  a  paradigm  for  the  conduct  of  an  optimal 
accelerated  life  test  under  the  statistical  model  of  Section  3.  It  turns  out  that  whereas  a  prescription 
of  the  needed  paradigm  can,  in  principle,  be  formulated  in  an  appealing  manner  -  via  Shannon’s 
measure  of  information  -  progress  towards  its  implementation  calls  for  a  consideration  of  simplifying 
scenarios  and  special  cases.  A  simplifying  scenario  is  outlined  in  Section  4,  and  a  consideration 
of  some  special  cases  given  in  Section  5.  These  indicate  the  flavour  of  the  approach  amd  provide 
insights  about  the  more  general  situations  -  insights  which  are  otherwise  counter  intuitive.  The 
hope  here  is  that  continued  activities  along  the  suggested  lines  wUJ  lead  to  developments  which  will 
contribute  towards  the  generation  of  a  comprehensive  package  of  coherent  techniques  which  will 
help  address  the  practical  issues  raised  by  this  important  problem.  These  and  other  such  matters 
constitute  Section  6,  which  concludes  the  paper. 

3  THE  STATISTICAL  MODEL 

Suppose  that  Xij  is  lognormal  with  parameters  /i,-  and  af,  for  I  <  i  <  k,  and  1  <  j  <  r,-- 
This  assumption  is  not  atypical  in  life  testing;  furthermore,  it  facilitates  connection  with  the  well 
developed  Bayesian  theory  of  linear  normal  models.  For  the  time  transformation  function,  the 
Power  Law,  popular  in  both  biometry  and  reliability  [Sethuraman  and  Singpurwalla  (1982)]  is 
assumed.  Under  this  law.  tw'o  versions  of  the  time  transformation  function  appear  to  be  reasonable, 
one  in  terms  of  E{X,j),  and  the  other  in  terms  of  M{Xij),  the  median  of  Xij.  Specifically,  for 
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unknown  constants  C  >  0  and  P  >  0,  it  is  assumed  that 


E{X,j)  =  exp(/i,  +  -r)  = 


5f’ 


(1) 


or  that 


M{X,j)  =  exp(/x.)  = 


(2) 


UYij  =  logX,j,  then  Xij  having  a  lognormal  distribution  with  parameters  m  and  af  -  henceforth 
written  as  ~  -  implies  that  Yij  is  normal  with  mean  /x,-  and  variance  af  -  henceforth 

written  as  ''Y^j  ~  As  a  model  of  observations,  it  is  common  to  suppose  that 


Yij  =  jii  +  6,j,  where  e,_,  ~  jV(0,a,^); 


(3) 


then,  letting  a  =  logC,  6  =  —P,Vi  =  log  S','  and  taking  logarithms  in  (1)  and  (2),  the  above 


relationships  may  be  re-written,  under  (1),  as 


Yij  =  a  -f  6 Vi - -h  €ij. 


(4) 


and 

Yij  =  a  bVi  -f-  €ij  ,  (3) 

under  (2).  It  is  instructive  to  note  that  under  (4)  the  model  of  observations  for  the  actual  life  lengths 
takes  the  form  Xij  =  E{Xij)iij ,  and  under  (5)  the  form  Xij  =  M{Xij)€ij,  where  e,'j  —  exp(€,'j), 
that  is  iij  ~  A(0,cr?).  Such  relationships  imply  that  an  observed  life  length  is  its  mean  (median) 
multiplied  by  an  innovation  from  lognormal  distribution  with  median  1  and  mean  exp(cr?/2).  Fur¬ 
thermore,  when  af  is  assumed  and  r,-  =  nj  for  aU  i  known,  the  relationships  (4)  and  (5)  imply 
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T 

that  the  response  vector  Y,  where  Y  =  (Yn, _ Yi„,  Yji, . . . ,  Y2„2, . . . ,  Yti, . . . ,  )^,  has  the 

/ 

structure  of  a  linear  model  of  the  form 

Y  =  A^^e,  (6) 

where,  e  ~  iV(0,  S),/3^  =  (a,h)-  The  design  matrix.  A,  is  given  by 

1...1  1...1  ...  1...1 

= 

V2_^  ... 

-  ^2 

and  E  is  the  specified  covariance  matrix 

In, 

0 

0 

where  I^-  is  the  Uj  x  nj  identity  matrix. 

Since  E  is  assumed  known,  there  are  several  strategies  that  could  be  used  to  facilitate  a  speci¬ 
fication  of  the  af,  all  consistent  with  the  physical  aspects  of  the  life  testing  scenario.  In  particular, 
it  is  reasonable  to  expect  that  aj  decreases  with  5,-,  so  that  cr^  >  >  . . .  >  With  regard  to 

the  above,  several  possibilities  come  to  mind,  one  of  which  is  that  af  =  K/Si,  where  K  is  specified, 
or  that  <7,-  =  where  is  specified. 
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In  view  of  the  above  considerations  and  (6),  it  follows  that 

(y  I  A,AS)~iV(A/3,S).  (7) 

A  final  ingredient  that  remains  to  be  specified  is  a  prior  assumption  for  the  vector  (3.  For  this,  it 
is  reasonable  to  assume  that 

(/3|^o,So)~iV{/3o,Eo),  (8) 

where 

0Q  —  i<io,bo),  and  Eq  = 

^ab 

are  specified.  In  cases  where  P  is  close  to  zero  and  P  >  0,  the  assumption  that  b  =  —P  is  normally 
distributed  may  be  disturbing.  A  strategy  for  overcoming  this  in  this  scenario  is  to  make  very 
small. 

A  generalization  of  (8)  would  involve  a  distributional  assumption  on  /3o  leading  to  the  hierar¬ 
chical  model  of  Lindley  and  Smith  (1972).  Such  a  generalization  would  be  prompted  by  concerns 
about  the  validity  of  the  power  law  as  a  time  transformation  function,  or  about  the  appropriateness 
of  the  time  transformation  function  over  the  entire  range  of  stresses;  see  for  example,  Blackwell 
and  SingpurwaUa  (1988).  For  the  purposes  of  this  paper  we  proceed  with  assumption  (8). 

It  is  weU  known  [Lindley  and  Smith  (1972)],  that  under  (7)  and  (8),  the  posterior  distribution 
of  f3,  given  the  data  y  obtained  under  A,  is  of  the  form 

(/3|t/,A,E,/3o,Eo)~iV(Z?d,Z>),  (9) 
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where 


D  =  (A^L-^A  +  Eg  1)-'  and  d  =  +  2^ '/3o). 

Here  y  is  a  realization  of  Y  and  is  obtained  by  replacing  each  Yij  in  Y  by  its  realization  j/.y.  The 
use  of  (9)  in  designing  an  optimal  accelerated  test  is  explicated  in  the  next  section. 

4  DESIGN  OF  AN  OPTIMAL  TEST 

It  was  stated  that  the  aim  of  accelerated  testing  is  to  facilitate  inference  about  Vu  =  logX^. 
Variables  which  control  the  quality  of  inference  are  the  design  matrix  A,  the  censoring  values 

Ti - Pk,  and  the  test  truncation  times  ri...,rfc.  Factors  which  dictate  choices  of  the  above 

variables  are  the  costs  of  testing  -  where  costs  would  include  the  time  spent  waiting  for  the  results 
of  the  test.  Optimal  choices  of  the  design  variables  would  involve  a  trade-off  between  the  costs 
of  testing  and  the  quality  of  the  inference.  Thus  for  example,  testing  at  14  itself  (or  as  close  to 
it  as  is  possible)  would  not  only  be  time  consuming  but  w'ould  also  result  in  data  with  a  large 
variability;  testing  as  far  away  from  14  as  is  possible  would  be  expeditious  and  produce  data  with 
small  variability.  How'ever,  testing  at  very  large  values  away  from  14  could  result  in  recording  and 
instantaneous  failure  of  all  items  on  test,  providing  little,  if  any,  information  about  Y^.  Advantages 
of  testing  in  the  vicinity  of  14  are  a  savings  in  costs  due  to  the  non-failure  of  some  of  the  items  on 
test,  and  the  knowledge  that  the  observed  data  are  likely  to  be  more  representative  of  the  actual  life 
lengths  under  14,  than  data  that  would  be  obtained  when  testing  away  from  14.  This  latter  issue 
would  be  germane  if  there  were  concerns  associated  with  the  validity  of  the  time  transformation 
function  over  the  range  of  stresses  considered.  The  set-up  of  Section  3  assumes  that  the  power 
law  is  operative  over  the  entire  range  of  values  of  the  stresses,  and  this  diminishes  the  sense  of  the 
above  argument  for  testing  in  the  vicinity  of 
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4.1  Consideration  of  a  Simplifying  Scenario 


In  many  scientific  experiments,  particularly  those  involving  biomedical  scenarios,  the  cost  of  testing 
is  either  secondary,  as  compared  with  the  quality  of  inference,  or  is  the  same  at  all  test  levels;  thus  in 
what  follows  the  costs  of  testing  will  not  be  considered.  Furthermore,  for  the  purpose  of  Illustration, 
it  win  be  assumed  that  r,-  =  n,-,  and  that  the  r,’s  are  infinite  for  all  i.  These  assumptions  imply 
no  form  of  censoring  and  truncation,  leaving  cin  optimal  choice  of  the  design  matrix  A  as  the  only 
decision  variable.  It  will  be  shown  in  Section  4.3  that  an  optimal  choice  of  A  is  facilitated  by 
a  consideration  of  the  expected  gain  in  Shannon  information,  for  which  the  prior  and  posterior 
predictive  distributions  of  Yu  are  the  essential  ingredients. 

4.2  The  Predictive  Distributions 
Under  the  model  of  Section  3,  it  is  clear  that 


{Yu\Vu,al,l3)^  N{[1Vu]/3,<tI), 


and  we  require  the  following  predictive  distributions;  FulFu,  au,/3o,  So  and  FulVu,  cr^,  S,  y,/3o,  Sq. 
It  can  be  shown  that 


( 

\ 

(K  1  K,a2,/3o,So)~iV 

[lK]/3o,[lK]So 

1 

V 

Vu 

/ 

and  that 


/ 

■ 

\ 

(yjK,a2,/3o,So,S,t/)~.V 

[1  Vu]Bd,  [1  Vu]D 

1 

\ 

K 

/ 

(12) 


(13) 
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4.3  The  Design  Criterion 


The  criterion  used  here  to  obtain  the  optimal  design  is  to  choose  that  design  matrix  A  which 
maximizes  the  expected  gain  in  Shannon  information.  Lindley  (1956)  proposed  the  use  of  Shannon 
information  gain  between  prior  and  posterior  in  the  context  of  Bayesian  inference.  Stone  (1959a) 
explored  this  further  in  the  content  of  experimental  design.  Stone  (1959b)  discussed  its  use  in 
the  Bayes  hierarchical  modeling  framework  as  a  unifying  mechanism  for  previous  ad  hoc  classical 
criteria.  Smith  and  Verdinelli  (1980),  Kadane  and  VerdineUi  (1990)  discuss  the  application  of 
Shaimon  information  within  the  Bayes  hierarchical  modelling  framework.  Whittle  (1973)  discusses 
computational  techniques  for  the  computation  of  the  optimal  design.  Chaloner  and  Larntz  (1986) 
apply  these  techniques  to  nonlinear  models. 

Specifically,  in  our  case,  one  needs  to  maximize,  with  respect  to  A,  the  quantity 

Ey  J piVu  I  y)log 

where  Ey  denotes  expectation  with  respect  to  the  marginal  distribution  of  Y .  This  marginal 
distribution  depends  on  A,(3q,'Eo  and  S,  and  is  given  [Lindley  and  Smith  (1972)]  as 

(y  I  A.^o,So,S)  ~  N{A(3o,A1:oA'^  +  E).  (15) 


The  integral  in  (14)  is  the  KuUback-Leibler  divergence  between  the  two  distributions  (12)  and  (13). 
Note  that  if  Z;  ~  iV(m/,  sf),  1=1,2,  then  the  KuUback-Leibler  divergence  between  the  distributions 
of  Zi,  and  Z2  is  given  by 


4  ,  (mi  -  m2)^ 
2  '  ’ 


’1 


’1 
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The  mi  and  5i  associated  with  (12)  are 


=  [1  Kx]y3o  >  -Si  =  [1 


+  < 


and  these  do  not  depend  on  y  nor  on  A.  However,  the  m2  and  S2  associated  with  (13)  depend  on 
both  y  and  and  A,  respectively,  and  can  be  written  as 


m2  =  [1  V;][A^E-U  +  +  So '/3o], 


and 


s^  =  [i  v;][A^s-‘A  +  So'] 


1 

K 


+  crl. 


Substituting  the  above  in  (14)  results  in  the  need  to  find  that  A  for  which  the  following  expression 
is  maximized 

-  log  t( A,  Fu)  +  t{A,  Vu)  +  Ey{mji  -  77^)^.  (16) 


where, 


i(A,F„)  =  [lFjA^E-‘A  +  Eo']-' 


1 


+  K 


Clearly,  an  anlytical  implementation  of  the  above  maximization  poses  a  formidable  task.  However, 
a  simplification  of  some  of  the  terms  in  the  above  results  in  expressions  which  facilitate  numerical 
computations.  Specifically, 


Ey{m\  —  1712)^  =  Trace  <  Var(F)E  ^AI? 


1  F„ 


DA^E-^ 
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where  Var(y)  =  AEqA^  +  S  from  (15).  Furthermore, 


[A^E-U  +  Eo']~‘ 


1 

r 

-1 

ELi^ 

+ 

<rl  <7ab 

2.^1= 1 

2 

^ah 

and  the  ingredients  necessary  to  obtain  AHqA^  +  E  are  specified  in  Section  3. 

It  is  of  interest  to  note  that  were  inference  about  /3,  rather  than  Y^,  the  focus  of  attention,  then 
A  would  be  chosen  so  as  to  maximize,  see  Smith  and  Verdinelli  (1980). 


Var(/3  t  Y)  |-‘=  |  A^E"' A  +  Eo  |  , 


(17) 


rather  than  the  expression  (16)  given  before.  Furthermore  if  the  cost  of  the  testing  is  an  issue  of 
concern,  then  one  needs  to  specify  the  value  of  information  on  a  scale  comparable  with  monetary 
units. 


5  CONSIDERATION  OF  SPECIAL  CASES 

The  simplest  case  to  consider  is  the  one  which  involves  knowing  C,  say  C  =  1,  so  that  cq  =  0.  If 
for  aU  i,  i  =  1, . . . ,  k,  af  =  n,-  =  1,  and  =  k  =  I,  that  is  only  a  single  unit  can  be  subjected  to 
an  accelerated  test,  then  the  optimal  design  problem  finds  the  stress  at  which  the  item  should  be 
tested  -  in  the  vicinity  of  14,  or  at  Vjt,  the  largest  stress.  For  this  scenario  E  =  Eq  =  1.  Algebraic 
manipulation  shows  that  the  single  item  should  be  tested  at  14.  This  conclusion  may  be  contrary 
to  intuition,  because  many,  influenced  by  the  notion  of  a  warm  feeling,  would  test  in  the  vicinity 
of  14.  Furthermore,  even  if  more  than  one  item  were  available  for  testing,  one  would  test  all  the 
items  at  14,  and  the  same  would  also  be  true  if  af  were  to  be  decreasing  in  i. 

In  connection  with  the  above,  an  important  question  that  needs  to  be  addressed  is  that  pertain- 
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ing  to  the  number  of  items  that  should  be  tested  at  Vt;  that  is,  how  large  should  nk  be?  For  this, 
one  investigates  the  behavior  of  (16)  -  the  expected  gain  in  Shannon  information  -  as  a  function  of 
Uk,  and  chooses  that  value  of  nt  after  which  (16)  shows  little  or  no  improvement.  For  E  =  So  =  1, 
the  behavior  of  (16)  as  a  function  of  njt,  for  Vk  =  2,  4,  6  and  8  respectively,  and  0,  is  shown 
in  Figure  5.1.  It  is  interesting  to  note  that  the  optimum  value  of  Uk  decreases  in  14;  this  is  to  be 
expected  since  the  variance  of  the  observed  life-lengths  decreases  as  the  stress  increases. 

When  P  is  known,  say  P  =  1,  but  C  unknown,  a  situation  quite  common  when  dealing  with 
the  fatigue  life  of  ball  bearings  [loannides  and  Harris  (1985)]  suggests  that  a  transformation  of  the 
data  would  result  in  a  model  of  the  form  Yij  =  a  e,j,  where  Yij  =  Yij  —  bVi  +  under  (2)  and 
Yij  =  Yij  -  bVi,  under  (5).  Clearly,  if  af  decreases  in  t  -  as  is  usually  assumed,  then  one  would 
test  all  the  items  under  stress  Vk  where  the  variance  of  €kj  is  the  smallest.  Were  erf  assumed  to 
be  a  constant  for  all  values  of  i,  then  it  would  not  matter  at  which  stress  the  items  are  tested. 
The  above  conclusion  can  also  be  arrived  at  by  proceeding  formally  in  terms  of  majcimizing  (14). 
Finally,  the  determination  of  an  optimal  rik  would  be  based  on  a  plot  of  the  behavior  of  the  utility 
versus  Uk,  similar  to  that  of  Figure  5.1. 

When  both  C  and  P  are  unknown  -  as  is  often  the  case  -  and  when  only  one  item  is  available 
for  testing,  then  under  the  assumption  that  E  =  1  and  Eq  =  hi  it  can  be  shown  that  the  optimum 
stress  V*  at  which  testing  is  to  be  done  is  given  by  that  value  of  V ,  say,  which  attains 

m^  ^-logt(F)-K(K)-b 

where  t(F)  =  {{V  —  +  1))  -f  Interesting,  the  answer  depends  on  I4.  K  Fu  =  0  (that 

is,  Su  =  1),  then  V*  turns  out  to  be  Vu- 

In  figure  5.2,  we  show  a  plot  of  the  expected  gain  in  Shannon  information  as  a  function  of  VJ, 
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when  Vu  =  0.  Observe  that  the  expected  gain  in  Shannon  information  is  a  monotonically  decreasing 
function  of  Vi  implying  that  when  E  =  1,  it  is  optimum  to  test  at  =  0. 

Suppose  now,  that  in  the  scenario  given  above,  testing  at  Vu  is  not  possible  and  that  one  could 
test  anyw'here  in  the  range  Vi  =  1  through  =  6.  Suppose  the  experimenter  has  two  observations 
with  one  at  V  =  2.  He  wishes  to  choose  the  position  of  the  second  one  optimally.  In  Figure  5.3, 
we  show  a  plot  of  the  expected  gain  in  Shzmnon  information  as  a  function  of  V)i,  the  stress  for  the 
second  item.  It  is  interesting  to  note  that  testing  at  Vjt  =  6  would  be  preferable  to  testing  at  any 
V;  >  1.25,  but  that  testing  in  the  range  (1, 1.25)  is  preferable  to  14  =  6. 

If  ^  =  af,  where  erf  known  but  decreasing  in  t,  then,  from  (18),  the  choice  V*  would  be  a 
trade-off  between  the  “loss  in  utility  ”  due  to  a  large  variance  for  testing  in  the  vicinity  of  14,  versus 
the  “loss  in  utility”  due  to  our  lack  of  knowledge  about  C  and  P  which  gets  emphasized  when  we 
test  away  from  V^. 

Our  development  leads  to  some  interesting  conclusions  when  both  C  and  P  are  unknown  and 
when  we  are  allowed  to  test  2  items.  Under  the  assumption  that  S  =  So  =  =  1.  K  =  0  and 

testing  is  possible  at  14  =  0,  then  the  optimum  decision  would  be  to  test  both  the  items  at  14; 
however,  if  testing  at  14  were  not  possible  and  if  14  =  2  is  the  weakest  stress  at  which  testing  could 
be  done  is  V  =  2,  then  the  optimum  decision  would  be  to  test  one  item  at  Vi  =  2  and  the  other  at 
Vt  =  6.  Furthermore,  if  Vi  =  1  and  Vk  =  5,  then  the  optimum  decision  would  be  to  test  both  the 
items  at  Vi  =  1,  or  to  test  one  item  at  Vi  =  1  and  the  other  at  Vt  =  5,  both  scenarios  giving  the 
same  Shannon  information  gain. 

The  implication  of  the  above  conclusions  is  that  to  maximize  the  expected  gain  in  Shannon 
information,  as  defined  by  (14),  one  should  test,  if  possible,  at  or  in  the  vicinity  of  V^  to  gain 
direct  knowledge  about  14.  However,  if  testing  in  the  vicinity  of  Vu  is  not  possible,  then  one  should 
gain  knowledge  about  14  via  a  knowledge  about  C  and  P,  and  the  latter  is  optimally  obtained  by 
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spreading  out  the  test  points  as  far  apart  as  is  possible.  It  so  happens  that  when  Vi  —  1  and  —  5, 
it  does  not  matter  how  knowledge  about  is  obtained  -  by  direct  observation  in  the  vicinity  of 
Vu  or  by  knowledge  about  C  and  P. 

6  CONCLUSIONS 

It  is  evident  from  the  material  of  Section  5  that  the  development  given  here  enables  one  to  address 
many  of  the  practical  issues  posed  by  the  problem  of  designing  accelerated  tests.  The  special  cases 
of  Section  5  have  assumed  that  af  is  a  constant.  We  have  focused  on  the  unknowns  in  the  time 
transformation  function.  Clearly,  an  important  parameter  to  learn  about  is  (Ty  itself.  Furthermore, 
we  have  not  considered  the  case  of  time  truncation  and  censoring,  nor  have  we  addressed  the  issue 
of  choosing  optimal  sample  sizes.  However,  all  of  the  above  considerations  can  in  principle  be 
undertaken  -  the  only  limitation  being  a  question  of  details.  All  the  same,  our  approach  does  bring 
out  matters  that  are  not  otherwise  intuitive  and  the  special  cases  give  us  many  clues  about  what 
to  expect  and  what  direction  to  proceed  when  considering  the  more  general  scenarios. 

Typically,  in  linear  model  with  k  parameters,  designs  concentrate  on  k  points.  This  is  not 
generally  true  in  nonlinear  situations  (Chaloner  and  Larntz,  1986).  In  our  case  we  have  k  =  2. 
Stone  (1959)  considers  the  number  of  points  required  to  meiximise  Shannon  information  when 
estimation  is  the  aim.  The  extension  to  the  predictive  case  requires  future  research. 

Appendix 

Suppose  that  the  experimenter  is  faced  with  a  decision  problem  for  the  random  variable  Yu, 
with  realisation,  y^-  Let  U{yu,d)  denote  the  utility  function,  where  d  ^  D,  the  decision  space. 
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Define  the  information  gained,  E{I{Y))  through  observing  the  random  variable  Y  by 


E{IiY))  =  Ey 


sup  Ey^\y  {UiYu.d)) 
deD 


-  sup  E’vu  {U(Yu,d)) 
deD 


(19) 


If  D  is  the  space  of  probability  distributions  for  prediction  at  the  point  Vu  aJid  utility  function, 


for  reporting  probability  densities,  is  the  logarthimic  function,  that  is,  Z7(yu,  d{-))  =  log  d(j/u),  then. 


sup  Ey^\y  (UiYu^d))  =  /  piyu\Y)logp{y^\Y)dy^ 
dQD  J 


and 


sup  {U{Y^,d))  =  /  p(yu)logp(l/u)d2/u  • 
deD  J 


Hence  the  information  gain  is 


E{I{Y))  =  E^y,y.) 


P(n|r)\ 

piY.)  J 


which  is  the  Shannon  information  between  the  posterior  predictive  density  at  Y^  to  the  prior 
predictive  density  at  Y^.  If  estimation  under  a  quadratic  loss  function  is  the  aim  then  the  criteria 
of  maximising  information  gained  becomes  that  of  minimising  posterior  variance  at  the  point  V^, 
that  is  criteria  (17). 
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